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ACCAAACAAG AGAAGAGACT TGCTTGGGAA TATTAAT TCA AATAAAAATT AACTTAGGAT 60 

TAAAGAACTT TACCGAAAGG TAAGGGGAAA GAAATCCTAA GACTGTAATC ATGTTGAGTC 120 

TATTCGACAC ATTCAGTGCG CGTAGGCAGG AGAACATAAC GAAATCAGCT GGTGGGGCTG 180 

TTATTCCCGG GCAAAAAAAC ACTGTGTCTA TATTTGC TCT TGGACCATCA ATAACAGAT G 24 0 

ACAATGATAA AATGACATTG GCTCTTCTCT TTTTGTCTCA TTCTTTAGAC AATGAAAAGC 300 

AGCATGCGCA AAGAGCTGGA TTTTTAGTTT CTCTGTTATC AATGGCTTAT GCCAACCCAG 360 

AATTATATTT AACATCAAAT GGTAGTAATG CAGATGT TAA ATATGTTATC TACAT GATAG 420 

AGAAAGACCC AGGAAGACAG AAATATGGTG GGT TTGTCGT CAAGACTAGA GAGATGGTTT 4 60 

ATGAAAAGAC AACTGATTGG ATGTTCGGGA GTGATCT TGA GTATGATCAA GACAATATGT 54 0 

TGCAAAATGG TAGAAGCACT TCTACAATCG AGGATCT TGT TCATACTTTT GGATATCCAT 600 

CGTGTCTTGG AGCCCTTATA ATCCAAGTTT GGATAATACT TGTTAAGGCT ATAACCAGTA 6S0 

TATCAGGATT GAGGAAAGGA TTCTTTACTC GGTTAGAAGC ATTTCGACAA GATGGAACAG 720 

TTAAATCCAG TCTAGTGTTG AGCGGTGATG CAGTAGAACA AATTGGATCA ATTATGAGGT 78 0 

CCCAACAGAG CTTGGTAACA CTCATGGTTG AAACACTGAT AACAATGAAC ACAGGCAGGA 84 0 

ATGATCTGAC AACAATAGAA AAGAATATAC AGATTGTAGG AAACTACATC AGAGATGCAG 900 

GTCTTGCTTC ATTTTTCAAC ACAATCAGAT ATGGCAT TGA GACTAGAATG GCAGC TCTAA 960 

CTCTGTCTAC CCTTAGACCG GATATCAACA GACTCAAGGC ACTGATCGAG TTATATCTAT 1020 

CAAAGGGGCC ACGTGCTCCT TTTATATGCA TTTTGAGAGA TCCCGTGCAT GGTGAGTTTG 1080 

CACCAGGCAA CTATCCTGCC CTCTGGAGTT ATGCGATGGG TGTAGCAGTT GTACAAAACA 1140 

AGGCCATGCA ACAGTATGTA ACAGGAAGGT CTTATCT GGA TATTGAAATG TTCCAACTTG 1200 

GTCAAGCAGT GGCACGTGAT GCCGAGTCGC AGATGAGTTC AATATTAGAG GATGAACTGG 12 60 

GGGTCACACA AGAAGCCAAG CAAAGCTTGA AGAAACACAT GAAGAACATC AGCAGTTCAG 1320 

ATACAACCTT TCATAAGCCT ACAGGGGGAT CAGCCATAGA AATGGCGATA GATGAAGAAG 1380 

CAGGGCAGCC TGAATCCAGA GGAGATCAGG ATCAAGGAGA TGAGCCTCGG TCATCCATAG 1440 

TTCCTTATGC ATGGGCAGAC GAAACCGGGA ATGACAATCA AACTGAATCA ACTACAGAAA 1500 

TTGACAGCAT CAAAACTGAA CAAAGAAACA TCAGAGACAG GCTGAACAAA AGACTCAACG 15 60 

AGAAAAGGAA ACAGAGTGAC CCGAGATCAA CTGACATCAC AAACAACACA AATCAAACTG 1620 

AAATAGATGA TTTGTTCAGT GCATTCGGAA GCAACTAGTC ACAAAGAGAT GACCACTATC 1680 

ACCAGCAACA AGTAAGAAAA ACTTAGGATT AATGGAAATT ATCCAATCCA GAGACGGAAG 17 40 

GACAAATCCA GAATCCAACC ACAACTCAAT CAACCAAAGA TTCATGGAAG ACAATGTTCA 18 00 

AAACAATCAA ATCATGGATT CTTGGGAAGA GGGATCAGGA GATAAATCAT CTGACATCTC 18 60 

ATCGGCCCTC GACATCATTG AATTCATACT CAGCACCGAC TCCCAAGAAA ACACGGCAGA 1920 

CAGCAATGAA ATCAACACAG GAACCACAAG ACTTAGCACG ACAATCTACC AACCTGAATC 1980 

CAAAACAACA GAAACAAGCA AGGAAAATAG TGGACCAGCT AACAAAAATC GACAGTTTGG 2040 

GGCATCACAC GAACGTGCCA CAGAGACAAA AGATAGAAAT GTTAATCAGG AGACT GTACA 2100 

GGGAGGATAT AGGAGAGGAA GCAGCCCAGA TAGTAGAACT GAGACTATGG TCACTCGAAG 2160 

AATCTCCAGA AGCAGCCCAG ATCCTAACAA TGGAACCCAA ATCCAGGAAG ATATT GATTA 2220 

CAATGAAGTT GGAGAGATGG ATAAGGACTC TACTAAGAGG GAAATGCGAC AATTTAAAGA 2280 

TGTTCCAGTC AAGGTATCAG GAAGTGATGC CATTCCTCCA ACAAAACAAG ATGGAGACGG 2340 
TGATGATGGA 2350 
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AGAGGCCTGG AATCTATCAG TACATTTGAT TCAGGATATA CCAGTATAGT GACTGCCGCA 2410 

ACACTAGATG ACGAAGAAGA ACTCCTTATG AAGAACAACA GGCCAAGAAA GTATCAATCA 2470 

ACACCCCAGA ACAGTGACAA GGGAATTAAA AAAGGGGTTG GAAGGCCAAA AGACACAGAC 2530 

AAACAATCAT CAATATTGGA CTACGAACTC AACT TCAAAG GATCGAAGAA GAGCCAGAAA 2590 

ATCCTCAAAG CCAGCACGAA TACAGGAGAA CCAACAAGAC CACAGAATGG ATCCCAGGGG 2 650 

AAGAGAATCA. CATCCTGGAA CA.TCCTCAAC AGCGAGAGCG GCAATCGAAC AGAAT CAACA 2710 

AACCAAACCC ATCAGACATC AACCTCGGGA CAGAACCACA CAATGGGACC AAGCA.GAACA 2770 

ACCTCCGAAC CAAGGATCAA GACACAAAAG ACGGATGGAA AGGAAAGAGA GGACACAGAA 2830 

GAGAGCACTC GATTTACAGA AAGGGCGATT ACATTATTAC AGAATCTTGG TGTAATCCAA 2890 

TCTGCAGCAA AATTAGACCT ATACCAAGAC AAGAGAGTTG TGTGTGTGGC GAATGTCCTA 2 950 

AACAATGCAG ATACTGCATC AAAGATAGAC TTCCTAGCAG GTTTGATGAT AGGAGTGTCA 3010 

ATGOATCA-TG ATACCAAATT AAATCAGATT CAGAACGAGA TATTAAGTTT GAAAACTGAT 3070 

CTTAAAAAGA TGGATGAATC ACATAGAAGA CTAATTGAGA ATCAAAAAGA ACAATTATCA. 3130 

CTGATCACAT CATTAATCTC AAATCTTAAA ATTATGACAG AGAGAGGAGG GAAGAAGGAC 3190 

CAACCAGAAC CTAGCGGGAG GACATCCATG ATCAAGACAA AAGCAAAAGA AGAGAAAATA 32 50 

AAGAAAGTCA GGTTTGACCC TCTTATGGAA ACACAGGGCA TCGAGAAAAA CATCCCTGAC 3310 

CTCTATAGAT CAATAGAGAA AACACCAGAA AACGACACAC AGATCAAATC AGAAATAAAC 3370 

AGATTGAATG ATGAATCCAA TGCCACTAGA TTAGTACCTA GAAGAATAAG CAGTACAATG 3 430 

AGATCATTAA TAATAATCAT TAACAACAGC AATTTATCAT CAAAAGCAAA GCAATCATAC 34 90 

ATCAACGAAC TCAAGCTCTG CAAGAGTGAC GAGGAAGTGT CTGAGTTGAT GGACATGTTC 3550 

AATGAGGATG TCAGCTCCCA GTAAACCGCC AACCAAGGGT CAACACCAAG AAAACCAATA 3 610 

GCACAAAACA GCCAATCAGA GACCACCCCA ATACACCAAA CCAATCAACA CATAACAAAG 3670 

ATCTCCAGAT CATAGATGAT TAAGAAAAAC TTAGGATGAA AGGACTAATC AATCCTCCGA 3730 

AACAATGAGC ATCACCAACT CCACAATCTA CACATTCCCA GAATCCTCTT TCTCCGAGAA 3790 

TGGCAACATA GAGCCGTTAC CACTCAAGGT CAATGAACAG AGAAAGGCCA TACCTCATAT 3850 

TAGGGTTGTC AAGATAGGAG ATCCGCCCAA ACATGGATCC AGATATCTGG ATGTCTTTTT 3910 

ACTGGGCTTC TTTGAGATGG AAAGGTCAAA AGACAGGTAT GGGAGCATAA GTGATCTAGA 3970 

TGATGATCCA AGTTACAAGG TTTGTGGCTC TGGATCATTG CCACTTGGGT TGGCTAGATA 4030 

CACCGGAAAT GATCAGGAAC TCCTACAGGC TGCAACCAAG CTCGATATAG AAGTAAGAAG 4 0 90 

AACTGTAAAG GCTACGGAGA TGATAGTTTA CACTGTACAA AACATCAAAC CTGAACTATA 4150 

TCCA.TGGTCC AGTAGATTAA GAAAAGGGAT GTTATTTGAC GCTAATAAGG TTGCACTTGC 4210 

TCCTCAATGT CTTCCACTAG ATAGAGGGAT AAAATTCAGG GTGATATTTG TGAACTGCAC 4270 

AGCAATTGGA TCAATAACTC TATTCAAAAT CCCTAAGTCC ATGGCATTGT TATCATTGCC 4330 

TAATACAATA TCAATAAATC TACAAGTACA TATCAAAACA GGAGTTCAGA CAGATTCCAA 43 90 

AGGAGTAGTT CAGATTCTAG ATGAAAAAGG TGAAAAATCA CTAAATTTCA TGGTTCATCT 4450 

CGGGTTGATC AAAAGGAAGA TGGGCAGAAT GTACTCAGTT GAATATTGTA AGCAGAAGAT 4 510 

CGAGAAGATG AGATTATTAT TCTCATTGGG ATTAGTT GGA GGGATCAGCT TCCA.CGTCAA 4570 

CGCAACTGGC TCTATATCAA AGACATTAGC AAGTCAATTA GCATTCAAAA GAGAAATCTG 4 630 

CTATCCCCTA ATGGATCTGA ATCCACACTT AAATTCAGTT ATATGGGCAT CATCAGTTGA 4 690 

AATTACAAGG 4700 
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GTAGATGCAG 
GCAAAAGGGG 
ATCAGGCTAC 
CGGAAAACAG 
AGACCGAGAA 
CAAACACACC 
CAGATATGAC 
AAATAGACAT 
AGATTTCACA 
ATTCACACTC 
TAATTCCTCT 
CCCACAACAA 
CGATAGGGAT 
AGGCAAAGTC 
AATCGATTCA 
TCAACAATGA 
TGGGAATTGC 
GAACACTGAA 
TAACGGAAAT 
CTGAGTCAAT 
AAGTTAGACT 
TATCATACAA. 
AAGGGGCTTT 
TATGTCCTTC 
ACATAACACA 
TGAATGGTGG 
ATAGAATTAA 
TAATAGGTAT 
CATTTGATGA 
AACTCAACAA 
AAAAGTTAC3A 
TGATGATAAT 
ATAGAATTCA 
GACAATAAGA 
TCAACACAGC 
TGGAATATTG 
GAGGCAAACA 
TAACAATATT 
ATAATAAATT 
AGAGGACTTC 



TTCTCCAGCC 
TCGGGAAAAT 
CCACAGGAGA 
CCAAACAAAC 
AAAACAGAAC 
AACAATCCTG 
CATCACAACC 
AACAAAACTG 
AAATTTCGAA 
ATGTGGGGAT 
ATATGATGGA 
TACTAATCTT 
AGCCACTTCA 
AGACATAGAA 
AAGTTCTGTA 
AATTATACCT 
ATTGACACAA 
AGAAAAAGGG 
ATTTACTACT 
CAAGATGAGA 
TCCTTTATTA 
CATCCAGGGC 
TCTAGGTGGT 
TGATCCAGGT 
GTGTCCTAAG 
ATTAATTGCA 
TCAATCACCT 
AAACGGAATG 
CATCATATTA 
GGCAAAACTA 
TTCCGTTGGA 
AATTCTAGT T 
GGGGAAAGAT 
CTATACACGA 
AGCACCGAAT 
GAAACACACA 
TAGTAGCAAG 
ATCAGTCAT T 
AATGTTGCAG 



TTCATTACCT 
CA.GACAGTAA 
AAAATCAAAA 
CAACACACAA 
GCACACAACC 
CAAACAAGCA 
ACAATCATAG 
CAAC GTGTAG 
ACGAGATACC 
CAACAGATAA 
TTAAAATTAC 
AGGACAAAAC 
GCACAAATCA 
AAACTCAAAG 
GGTAACCTAA 
TCAATCACAA 
CATTACTCAG 
ATAAAATTAC 
TCAACAGTTG 
GTGATAGATG 
ACTAAACTAT 
AAAGAGTGGT 
GCTGATATTA 
TACATATTAA 
ACTGTTGTTA 
AACTGCATAA 
GATCAAGGAA 
TTATTCAATA 
AATAACTCTG 
GAATTAGAAG 
AGTTGGTATC 
ATAATCAATA 
CAAAACGACA 
TCAAATATAA 
AGACCAAAAG 
AACAGCATAA 
GTTACAAATA 
TTTATAATGA 
GAAATAAGAA 



GGCGAAT TCA 

aatc:aacaac 

ACTTAGGATC 
ATCACAGACA 
AAGCAGAGAA 
CCAAAACAGA 
CCA.TATTACT 
GTGTGTTAGT 
TGATATTAAG 
ACCAATACAA 
AAAAAGATGT 
GATTCTT TGG 
CCGCAGCAGT 
AGGCTATAAG 
TTGTTGCAGT 
GATTAGGCTG 
AATTAACAAA 
AAGGGATAGC 
ACCAATATGA 
TTGATTT GAG 
CAAATAC TCA 
ATATTCCT CT 
AAGAATGCAT 
ATCACGAGAT 
CATCAGATGT 
CAACTACATG 
TTAAGAT CAT 
CTAATAGAGA 
TTGCACTTAA 
AATCGAAGGA 
AATCTAGTGC 
TAACAAT TAT 
AAAACAGTGA 
AAAGTACAAA 
GCAGCGCAGA 
ATAACACCAA 
TCATAAT GTA 
TATTGACAAA 
AAGAATT CGC 



GATACTACCC 
CCTGATATCC 
AAAGGGATCA 
AAAAGGAGAA 
AAGCCAAAGC 
GGTCAAAAGA 
AATACCCCCA 
CAACAATCCT 
TTTGATACCC 
GAAGTTATTG 
AATAGTAGTA 
AGAGATAATT 
CGCTCTTGTC 
AGACACAAAC 
TAAATCAGT T 
TGAAGCAGCA 
TATATTTGGT 
ATCATTATAT 
TATTTATGAC 
TGATTACTCA 
AATTTATAAA 
TCCCAATCAC 
AGAGGCATTC 
AGAGAATTGT 
GGTACCACGA 
TACATGCAAT 
AACACATAAA 
AGGGACATTA 
TCCAATTGAT 
ATGGATAAAG 
AACAATCACC 
TGTAGTCATA 
GCCGTATATA 
AAACTTAGGA 
GGCGACACCA 
CAATGAAACC 
CACCTTCTGG 
CTTAATTCAA 
GGCAATAGAC 



AAACATCATA 
AACATTGCAA 
CCACGAACCC 
GGCACTGCAA 
CCGCCATTCA 
CAAAGAGCAC 
TCATT TTGTC 
AAAGGCATGA 
AAAATAGAGA 
GATAGATTGA 
AGTCATGAAA 
GGGACAATTG 
GAAGC TAAAC 
AAGGCAGTAC 
CAAGACTATG 
GGGTTACAAT 
GATAATATAG 
CACACAAACA 
CTATTATTCA 
ATTACTCTTC 
GTAGATTCTA 
ATCATGACAA 
AGCAGTTATA 
TTATCAGGGA 
TACGCGTTTG 
GGAAT TGACA 
GAATG CCAGG 
GCAAC TTATA 
ATATCTATGG 
AAATCAAATC 
ATAAT CATAG 
ATCAAATTCC 
CTGACAAATA 
ACAAAGTTGT 
AACTCAAAAA 
GAAACAGCCA 
ACAATAACAT 
GAGAACAATC 
ACCAAGATTC 



4940 
5000 
5060 
5120 
5180 
5240 
5300 
5360 
5420 
5480 
5540 
5600 
5660 
5720 
5780 
5840 
5900 
5960 
6020 
6080 
6140 
6200 
6260 
6320 
6380 
64 40 
6500 
6560 
6620 
6680 
6740 
6800 
6860 
6920 
6980 
7040 
7050 
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GGATGACATT GGAACCTCAA TACAGTCAGG AATAAATACA AGACTTCTCA CAATTCAGAG 7110 



TCATGTTCAA AACTATATCC CACTATCATT AACACAA.CAA ATGTCAGATC TCAGAAAATT 7170 

TATCAATGAT CTAACAAATA AAAGAGAACA. TCAAGAAGTG CCAATACAGA GAATGACTCA 7230 

TGATAGAGGT ATAGAACCCC TAAATCCAAA CAAGTTCTGG AGGTGTACAT CTGGTAACCC 72 90 

ATCTCTAACA AGTAGTCCTA AGATAAGGTT AATACCAGGA CCAGGTTTAT TAGCAACATC 7350 

TACTACAGTA AATGGCTGTA TTAGAATTCC ATCGTTAGTA ATCAATCATC TAATCTATGC 7410 

TTACACCTCT AATCTTATTA CCCAGGGCTG TCAAGATATA GGGAAATCTT ACCAAGTACT 74 70 

ACAAATAGGG ATAATTACTA TAAAT TCGGA CCTAGTACCT GAT TTAAACC CCAGAGTCAC 7530 

ACATACATTT AATATTGATG ATAATAGAAG ATCTTGCTCT CTGGCACTAT TGAATACAGA 75 90 

TGTTTATCAG TTATGCTCAA CACCAAAAGT TGATGAAAGA TCCGATTATG CATCAACAGG 7650 

TATTGAGGAT ATTGTACTTG ACATTGTCAC TAATAATGGA TTAATTATAA CAACAAGGTT 7710 

TACAAA.TAAT AATATAACTT TTGATAAACC GTATGCAGCA TTGTATCCAT CAGTGGGACC 7770 

AGGAATCTAT TATAAGGATA AAGTTATATT TCTCGGATAT GGAGGTCTAG AGCATGAAGA 7830 

AAACGGAGAC GTAATATGTA ATACAACTGG TTGTCCTGGC AAAACACAGA GAGACTGTAA 78 90 

TCAGGCTTCT TATAGCCCAT GGTTCTCAAA TAGGAGAATG GTAAACTCTA TTATTGTTGT 7950 

TGATAAAGGC ATAGATGCAA CTTTTAGCTT GAGGGTGTGG ACTATTCCAA TGAGCCAAAA 8010 

TTATTGGGGA TCAGAAGGAA GATTACTTTT ATTAGGT GAC AGAATATACA TATATACTAG 8070 

ATCCACAAGT TGGCACAGTA AATTACAGTT AGGGGTAATT GATATTTCTG ATTATACTAA 3130 

TATAAGAATA AATTGGACTT GGCATAATGT ACTATCACGG CCAGGGAATG ATGAATGTCC 8190 

ATGGGGTCAT TCATGCCCAG ACGGATGTAT AACAGGAGTT TACACTGATG CATATCCGCT 8250 

AAACCCATCG GGGAGTGTTG TATCATCAGT AATTCTTGAT TCACAAAAGT CTAGAGAAAA 83X0 

CCCAATCATT ACTTACTCAA CAGCTACAAA TAGAATAAAT GAATTAGCTA TATATAACAG 8370 

AACACTTCCA GCTGCATATA CAACAACAAA TTGTATCACA CATTATGATA AAGGGTATTG 8430 

TTTTCATATA GTAGAAATAA ATCACAGAAG TTTGAATACG TTTCAACCTA TGTTATTCAA 84 90 

AACAGAAGTT CCAAAAAACT GCAGCTAAAT TGATCATCGC ATATCGGATG CAAGATGACA 8550 

TTAAAAGAGA CCACCAGACA GACAACACAG GAGACGATGC AAGATATAAA GAAATAATAA 8 610 

AAAACTTAGG AGAAAAGTGT GCAAGAAAAA TGGACACCGA GTCCCACAGC GGCACAACAT 8670 

CTGACATTCT GTACCCTGAA TGTCACCTCA ATTCTCCTAT AGTTAAAGGA AAGATAGCAC 8730 

AACTGCATAC AATAATGAGT TTGCCTCAGC CCTACGATAT GGATGATGAT TCAATACTGA 8790 

TTATTACTAG ACAAAAAATT AAACTCAATA AATTAGATAA AAGACAACGG TCAATTAGGA 8850 

AATTAAGATC AGTCTTAATG GAAAGAGTAA GTGATCTAGG TAAATATACC TTTATCAGAT 8 910 

ATCCAGAGAT GTCTAGTGAA ATGTTCCAAT TATGTATACC CGGAATTAAT AATAAAATAA 8 970 

ATGAATTGCT AAGTAAAGCA AGTAAAACAT ATAATCAAAT GACTGATGGA TTAAGAGATC 9030 

TATGGGTTAC TATACTATCG AAGTTAGCAT CGAAAAATGA TGGAAGTAAT TATGATATCA 9090 

ATGAAGATAT TAGCAATATA TCAAATGTTC ACATGACTTA TCAATCAGAC AAATGGTATA 9150 

ATCCATTCAA GACATGGTTT ACTATTAAGT ATGACATGAG AAGATTACAA AAAGCCAAAA 9210 

ATGAGATTAC ATTCAATAGG CATAAAGATT ATAATCTATT AGAAGACCAA AAGAATATAT 9270 

TGCTGATACA TCCAGAACTC GTCTTAATAT TAGATAAACA AAATTACAAT GGGTATATAA 9330 

TGACTCCTGA ATTGGTACTA ATGTATTGTG ATGTAGTTGA AGGGAGGTGG AATATAAGTT 9390 

CATGTGCAAA 9400 
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ATTGGATCCT AAGTTACAAT CAATGTATTA TAAGGGTAAC AATTTATGGG AAATAATAGA 94 60 

TGGACTATTC TCGACCTTAG GAGAAAGAAC ATTTGACATA ATATCACTAT TAGAACCACT 9520 

TGCATTATCG CTCATTCAAA CTTATGACCC GGTTAAACAG CTCAGGGGGG CTTTTTTAAA 9580 

TCACGTGTTA TCAGAAATGG AATTAATATT TGCAGCTGAG TGTACAACAG AGGAAATACC 9640 

TAATGTGGAT TATATAGATA AAATTTTAGA TGTGTTCAAA GAATCAACAA TAGATGAAAT 9700 

AGCAGAAATT TTCTCTTTCT TCCGAACTTT TGGACACCCT CCATTAGAGG CGAGTATAGC 97 60 

AGCAGAGAAA GTTAGAAAGT ATATGTATAC TGAGAAATGC TTGAAATTTG ATACTATCAA 9820 

TAAATGTCAT GCTATTTTTT GTACAATAAT TATAAATGGA TATAGAGAAA GACATGGTGG 9880 

TCAATGGCCT CCAGTTACAT TACCTGTCCA TGCACAT GAA TTTATCATAA ATGCATACGG 9940 

ATCAAATTCT GCCATATCAT ATGAGAATGC TGTAGATTAT TATAAGAGCT TCATAGGAAT 10000 

AAAATTTGAC AAGTTTATAG AGCCTCAATT GGATGAAGAC TTAACTATTT ATATGAAAGA 10060 

TAAAGCATTA TCCCCAAAGA AATCAAACTG GGACACAGTC TATCCAGCTT CAAACCTGTT 10120 

ATACCGCACT AATGTGTCTC ATGATTCACG AAGATTGGTT GAAGTATTTA TAGCAGATAG 10180 

TAAATTTGAT CCCCACCAAG TATTAGATTA CGTAGAATCA GGATATTGGC TGGATGATCC 10240 

TGAATTTAAT ATCTCATATA GTTTAAAAGA GAAAGAAATA AAACAAGAAG GTAGACTTTT 10300 

TGCAAAAATG ACATACAAGA TGAGGGCTAC ACAAGTATTA TCAGAAACAT TATTGGCGAA 10360 

TAATATAGGG AAATTCTTCC AAGAGAATGG GATGGTTAAA GGAGAAATTG AATTACTCAA 10420 

GAGACTAACA ACAATATCTA TGTCTGGAGT TCCGCGGTAT AATGAGGTAT ACAATAATTC 10480 

AAAAAGTCAC ACAGAAGAAC TTCAAGCTTA TAATGCAATT AGCAGTTCCA ATTTATCTTC 1054 0 

TAATCAGAAG TCAAAGAAGT TTGAATTTAA ATCTACAGAT ATATACAATG ATGGATACGA 10600 

AACCGTAAGC TGCTTCTTAA CGACAGATCT TAAAAAATAT TGTTTAAATT GGAGGTATGA 10 660 

ATCAACAGCT TTATTCGGTG ATACTTGTAA TCAGATATTT GGGTTAAAGG AATTATTTAA 10 720 

TTGGCTGCAC CCTCGCCTTG AAAAGAGTAC AATATATGTT GGAGATCCTT ATTGCCCGCC 10780 

ATCAGATATT GAACATTTAC CACTTGATGA CCATCCTGAT TCAGGATTTT ATGTTCATAA 10840 

TCCTAAAGGA GGAATAGAAG GGTTTTGCCA AAAGTTATGG ACACTCATAT CTATCAGTGC 10 900 

AATACATTTA GCAGCTGTCA AAATCGGTGT AAGAGTTACT GCAATGGTTC AAGGGGATAA 10960 

TCAAGCCATA GCTGTTACCA CAAGAGTACC TAATAAT TAT GATTATAAAG TTAAGAAAGA 11020 

GATTGTTTAT AAAGATGTGG TAAGATTTTT TGATTCCTTG AGAGAGGTGA TGGATGATCT 11080 

GGGTCATGAG CTCAAACTAA ATGAAACTAt AATAAGTAGT AAAATGTTTA TATATAGCAA 11140 

AAGGATATAC TATGACGGAA GAATCCTTCC TCAGGCATTA AAAGCATTGT CTAGATGTGT 11200 

TTTTTGGTCT GAAACAATCA TAGATGAGAC AAGATCAGCA TCCTCAAATC TGGCTACATC 11260 

GTTTGCAAAG GCCATTGAGA ATGGCTACTC ACCTGTATTG GGATATGTAT GCTCAATCTT 11320 

CAAAAATATC CAACAGTTGT ATATAGCGCT TGGAATGAAT ATAAACCCAA CTATAACCCA 11380 

AAATATTAAA GATCAATATT TCAGGAATAT TCATTGGATG CAATATGCCT CCTTAATCCC 11440 

TGCTAGTGTC GGAGGATTTA ATTATATGGC CA-TGTCAAGG TGTTTTGTCA GAAACATTGG 11500 

AGATCCTACA GTCGCTGCGT TAGCCGATAT TAAAAGATTT ATAAAAGCAA ATTTGTTAGA 115 60 

TCGAGGTGTC CTTTACAGAA TTATGAATCA AGAACCAGGC GAGTCTTCTT TTTTAGACTG 11620 

GGCCTCAGAT CCCTATTCAT GTAACTTACC ACAATCTCAA. AATATAACCA CCATGATAAA 11680 

GAATATAACT GCAAGAAATG TACTACAGGA CTCACCAAAC CCATTACTAT CTGGATTATT 11740 

TACAAGTACA 11750 
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ATGA.TAGAAG AGGATGAGGA ATTAGGTGAG TTCCTAATGG ACAGGAGAAT AATCCTCCCA 11810 

AGAGTTGCAC ATGACATTTT AGATAATTCT CTTACTGGAA TTAGGAATGC TATAGCTGGT 11870 

ATGTTGGATA CAACAAAAT C ACTAATTCGA GTAGGGATAA GCAGAGGAGG ATTAACCTAT 11930" 

AACTTATTAA GAAAGATAAG CAACTATGAT CTTGTACAAT ATGAGACACT TAGTAAAACT 11990 

TTAAGACTAA TAGTCAGTGA CAAGATTAAG TATGAAGATA TGTGCTCAGT AGACCTAGCC 12050 

ATATCATTAA GACAAAiWi.T GTGGATGCAT TTATCAGGAG GAiGAATGAT AAATGGACTT 12110 

GAAACTCCAG ATCCTTTAGA GTTACTGTCT GGAGTAATAA TAACAGGATC TGAACATTGT 12170 

AGGATATGTT ATTCAACTGA AGGTGAAAGC CCATATACAT GGATGTATTT ACCAGGCAAT 12230 

CTTAATATAG GATCAGCTGA GACAGGAATA GCATCAT TAA GGGTCCCTTA CTTTGGATCA 12290 

GTTACAGATG AGAGATCTGA AGCACAATTA GGGTATATCA AAAATCTAAG CAAACCAGCT 12350 

AAGGCTGCTA TAAGAATAGC AATGATATAT ACTTGGGCAT TTGGGAATGA CGAAATATCT 12410 

TGGATGGAAG CATCACAGAT TGCACAAACA CGTGCAAACT TTACATTGGA TAGCTTAAAG 124 70 

ATTTTGACAC CAGTGACAAC ATCAACAAAT CTATCACACA GGTTAAAAGA TACTGCTACT 12530 

CAGATGAAAT TTTCTAGTAC ATCACTTATT AGAGTAAGCA GGTTCATCAC AATAT CTAAT 125 90 

GATAATATGT CTATTAAAGA AGCAAATGAA ACTAAAGATA CAAATCTTAT TTATCAACAG 12 650 

GTAATGTTAA CAGGATTAAG TGTATTTGAA TATCTATTTA GGTTAGAGGA GAGTACAGGA 12710 

CATAACCCTA TGGTCATGCA TCTACATATA GAGGATGGAT GTTGTATAAA AGAGAGTTAC 12770 

AATGATGAGC ATATCAATCC GGAGTCTACA TTAGAGTTAA TCAAATACCC TGAGAGTAAT 12830 

GAATTTATAT ATGATAAGGA CCCTTTAAAG GATATAGATC TATCAAAATT AATGGTTATA 128 90 

AGAGATCATT CTTATACAAT TGACATGAAT TACTGGGATG ACACAGATAT TGTACATGCA 12950 

ATATCAATAT GTACTGCAGT TACAATAGCA GATACAATGT CGCAGCTAGA TCGGGATAAT 13010 

CTTAAGGAGC TGGTTGTGAT TGCAAATGAT GATGATATTA ACAGTCTGAT AACTGAATTT 13070 

CTGACCCTAG ATATACTAGT GTTTCTCAAA ACATTTGGAG GGTTACTCGT GAATCAATTT 13130 

GCATATACCC TTTATGGATT GAAAATAGAA GGAAGGGATC CCATTTGGGA TTATATAATG 13190 

AGAACATTAA AAGACACCTC ACATTCAGTA CTTAAAGTAT TATCTAATGC ACTATCTCAT 13250 

CCAAAAGTGT TTAAGAGATT TTGGGATTGT GGAGTTTTGA ATCCTATTTA TGGTC CTAAT 13310 

ACTGCTAGTC AAGATCAAGT TAAGCTTGCT CTCTCGATTT GCGAGTACTC CTTGGATCTA 13370 

TTTATGAGAG AATGGTTGAA TGGAGCATCA CTTGAGATCT ATATCTGTGA TAGTGACATG 13430 

GAAATAGCAA ATGACAGAAG ACAAGCATTT CTCTCAAGAC ATCTTGCCTT TGTGTGTTGT 134 90 

TTAGCAGAGA TAGCATCTTT TGGACCAAAT TTATTAAATC TAACATATCT AGAGAGACTT 13550 

GATGAATTAA AACAATACTT AGATCTGAAC ATCAAAGAAG ATCCTACTCT TAAATATGTG 13610 

CAAGTATCAG GACTGTTAAT TAAATCATTC CCCTCAACTG TTACGTATGT AAGGAAAACT 13670 

GCGATTAAGT ATCTGAGGAT TCGTGGTATT AATCCGCCTG AAACGATTGA AGATTGGGAT 13730 

CCCATAGAAG ATGAGAATAT CTTAGACAAT ATTGTTAAAA CTGTAAATGA CAATTGCAGT 13790 

GATAATCAAA AGAGAAATAA AAGTAGTTAT TTCTGGGGAT TAGCTCTAAA GAATTATCAA 13850 

GTCGTGAAAA TAAGATCCAT AACGAGTGAT TCTGAAGTTA ATGAAGCTTC GAATGTTACT 13910 

ACACATGGAA TGACACTTCC TCAGGGAGGA AGTTATCTAT CACATCAGCT GAGGT TATTT 13970 

GGAGTAAACA GTACAAGTTG TCTTAAAGCT CTTGAATTAT CACAAATCTT AATGAGGGAA 14030 

GTTAAAAAAG ATAAAGATAG ACTCTTTTTA GGAGAAGGAG CAGGAGCTAT GTTAGCATGT 14 0 90 

TATGATGCTA 14100 
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CACTCGGTCC TGCAATAAAT TATTATAATT CTGGTTTAAA TATTACAGAT GTAAT TGGTC 14160 

AACGGGAATT AAAAATCTTC CCATCAJSAAG TATCATTAGT AGGTAAAAAA CTAGGAAATG 14220 

TAACACAGAT TCTTAATCGG GTGAGGGTGT TATTTAATGG GAATCCCAAT TCAACATGGA 14280 

TAGGAAATAT GGAATGTGAG AGTTTAATAT GGAGTGAATT AAATGATAAG TCAATTGGTT 14340 

TAGTACATTG TGACATGGAG GGAGCGATAG GCAAATCAGA AGAAACTGTT CTACATGAAC 14400 

ATTATAGTAT TATTAGGATT ACATATTTAA TCGGGGATGA TGATGTTGTC CTAGTATCAA 14460 

AAATTATACC AACTATTACT CCGAATTGGT CTAAAATACT CTATCTATAC AAGTTGTATT 14520 

GGAAGGATGT AAGTGTAGTG TCCCTTAAAA CATCCAATCC TGCCTCAACA GAGCTTTATT 14590 

TAATTTCAAA AGATGCTTAC TGTACTGTAA TGGAACCCAG TAATCTTGTT TTATCAAAAC 14 640 

TTAAAAGGAT ATCATCAATA GAAGAAAATA ATCTATTAAA GTGGATAATC TTATCAAAAA 14700 

GG;V?VGAATAA CGAGTGGTTA CAGCATGAAA TCAAAGAAGG AGAAAGGGAT TATGGGATAA 14760 

TGAGGCCATA TCATACAGCA CTGCAAATTT TTGGATTCCA AATTAACTTA AATCACTTAG 14 820 

CTAGAGAATT TTTATCAACT CCTGATTTAA CCAACATTAA TAATATAATT CAAAGTTTTA 14 880 

CAAGAACAAT TAAAGATGTT ATGTTCGAAT GGGTCAATAT CACTCATGAC AATAAAAGAC 14940 

ATAAATTAGG AGGAAGATAT AATCTATTCC CGCTTAAAAA TAAGGGGAAA TTAAGATTAT 15000 

TATCACGAAG ATTAGTACTA AGCTGGATAT CATTATCCTT ATCAACCAGA TTACTGACGG 150 60 

GCCGTTTTCC AGATGAAAAA TTTGAAAATA GGGCACAGAC CGGATATGTA TCATTGGCTG 15120 

ATATTGATTT AGAATCCTTA AAGTTATTAT CAAGAAATAT TGTCAAAAAT TACAAAGAAC 15180 

ACATAGGATT AATATCATAC TGGTTTTTGA CCAAAGAGGT CAAAATACTA ATGAAGCTTA 15240 

TAGGAGGAGT CAAACTACTA GGAATTCCTA AACAGTACAA AGAGTTAGAG GATCGATCAT 15300 

CTCAGGGTTA TGAATATGAT AATGAATTTG ATATTGATTA ATACATAAAA ACATAAAATA 15360 

AAACACCTAT TCCTCACCCA TTCACTTCCA ACAAAATGAA AAGTAAGAAA AACAT GTAAT 15420 

ATATATATAC CAAACAGAGT TTTTCTCTTG TTTGGT 1545 6 
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ACCAAACAAG AGAA.GAGACT TGCTTGGGAA TATTAATTCA AATAAAAATT AACTTAGGAT 60 
TAAAGAACTT TACCGAAAGG TAAGGGGAAA GAAATCCTAA GACTGTAATC ATGTTGAGTC 120 

TATTCGACAC ATTCAGTGCG CGTAGGCAGG AGAACATAAC GAAATCAGCT GGTGGGGCTG 180 

TTATTCCCGG GCAAAAAAAC ACTGTGTCTA TATTTGCTCT TGGACCATCA ATAACAGATG 240 

ACAATGATAA AATGACATTG GCTCTTCTCT TTTTGTCTCA TTCTTTAGAC AATGAAAAGC 300 

AGCATGCGCA AAGAGCTGGA TTTTTAGTTT CTCTGTTATC AATGGCTTAT GCCAACCCAG 360 
AATTATATTT AACATCAAAT GGTAGTAATG CAGATGT TAA ATATGTTATC TACATGATAG 420 

AGAAAGACCC AGGAAGACAG AAATATGGTG GGTTTGTCGT CAAGACTAGA GAGAT GGTTT 4 80 

ATGAAAAGAC AACTGATTGG ATGTTCGGGA GTGATCT TGA GTATGATCAA GACAATATGT 540 
TGCAAAATGG TAGAAGCACT TCTACAATCG AGGATCTTGT TCATACTTTT GGATATCCAT 600 
CGTGTCTTGG AGCCCTTATA ATCCAAGTTT GGATAATACT TGTTAAGGCT ATAACCAGTA 660 
TATCAGGATT GAGGAAAGGA TTCTTTACTC GGTTAGAAGC ATTTCGACAA GATGGAACAG 720 
TTAAATCCAG TCTAGTGTTG AGCGGTGATG CAGTAGAACA AATTGGATCA ATTAT GAGGT 780 
CCCAACAGAG CTTGGTAACA CTCATGGTTG AAAXLACT GAT AACAATGAAC ACAGGCAGGA 84 0 

ATGATCTGAC AACAATAGAA AAGAATATAC AGATTGTAGG AAACTACATC AGAGATGCAG 900 
GTCTTGCTTC ATTTTTCAAC ACAATCAGAT ATGGCATTGA GACTAGAATG GCAGCTCTAA 960 

CTCTGTCTAC CCTTAGACCG GATATCAACA GACTCAAGGC ACTGATCGAG TTATATCTAT 1020 

CAAAGGGGCC ACGTGCTCCT TTTATATGCA TTTTGAGAGA TCCCGTGCAT GGTGAGTTTG 1080 

CACCAGGCAA CTATCCTGCC CTCTGGAGTT ATGCGATGG6 TGTAGCAGTT GTACAAAACA 114 0 

AGGCCATGCA ACAGTATGTA ACAGGAAGGT CTTATCTGGA TATTGAAATG TTCCAACTTG 1200 

GTCAAGCAGT GGCACGTGAT GCCGAGTCGC AGATGAGTTC AATATTAGAG GATGAACTGG 1260 

GGGTCACACA AGAAGCCAAG CAAAGCTTGA AGAAACACAT GAAGAACATC AGCAGTTCAG 1320 

ATACAACCTT TCATAAGCCT ACAGGGGGAT CAGCCATAGA AATGGCGATA GATGAAGAAG 1380 

CAGGGCAGCC TGAATCCAGA GGAGATCAGG ATCAAGGAGA TGAGCCTCGG TCATCCATAG 1440 

TTCCTTATGC ATGGGCAGAC GAAACCGGGA ATGACAATCA AACTGAATCA ACTACAGAAA 1500 

TTGACAGCAT CAAAACTGAA CAAAGAAACA TCAGAGACAG GCTGAACAAA AGACTCAACG 15 60 

AGAAAAGGAA ACAGAGTGAC CCGAGATCAA CTGACATCAC AAACAACACA AATCAAACTG 1620 

AAATAGATGA TTTGTTCAGT GCATTCGGAA. GCAACTAGTC ACAAAGAGAT GACCACTATC 1 680 

ACCAGCAACA AGTAAGAAAA ACTTAGGATT AATGGAAATT ATCCAATCCA GAGACGGAAG 1740 

GACAAATCCA GAATCCAACC ACAACTCAAT CAACCAAAGA TTCA.TGGAAG ACAATGTTCA 1800 

AAACAATCAA ATCATGGATT CTTGGGAAGA GGGATCAGGA GATAAATCAT CTGACATCTC 18 60 

ATCGGCCCTC GACATCATTG AATTCATACT CAGCACCGAC TCCCAAGAAA ACACGGCAGA 1 920 

CAGCAATGAA ATCAACACAG GAACCACAAG ACTTAGCACG ACAATCTACC AACCTGAATC 1980 

CAAAACAACA GAAACAAGCA AGGAAAATAG TGGACCAGCT AACAAAAATC QACAGTTTGG 2040 

GGCATCACAC GAACGTGCCA CAGAGACAAA AGATAGAAAT GTTAATCAGG AGACTGTACA 2100 

GGGAGGATAT AGGAGAGGAA GCAGCCCAGA TAGTAGAACT GAGACTATGG TCACTCGAAG 2160 

AATCTCCAGA AGCAGCCCAG ATCCTAACAA TGGAACCCAA ATCCAGGAAG ATATTGATTA 2220 

CAATGAAGTT GGAGAGATGG ATAAGGACTC TACTAAGAGG GAAATGCGAC AATTTAAAGA 22B0 

TGTTCCAGTC AAGGTATCAG GAAGTGATGC CATTCCTCCA ACAAAACAAG ATGGAGACGG 2340 

TGATGATGGA 2350 
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AGAGGCCTGG AATCTATCAG TACATTTGAT TCAGGATATA CCAGTATAGT GACTGCCGCA 2410 

ACACTAGATG ACGAAGAAGA ACTCCTTATG AAGAACAACA GGCCAAGAAA GTATCAATCA 2470 

ACACCCCAGA ACAGTGACAA GGGAATTAAA AAAGGGGTTG GAAGGCCAAA AGACACAGAC 2530 

AAACAATCAT CAATATTGGA CTACGAACTC AACTTCAAAG GATCGAAGAA GAGCCAGAAA 25 90 

ATCCTCAAAG CCAGCACGAA TACAGGAGAA CCAACAAGAC CACAGAATGG ATCCCAGGGG 2650 

AAGAGAATCA CATCCTGGAA CATCCTCAAC AGCGAGAGCG GCAATCGAAC AGAATCAACA 2710 

AACCAAACCC ATCAGACATC AACCTCGGGA CAGAACCACA CAATGGGACC AAGCAGAACA 277 0 

ACCTCCGAAC CAAGGATCAA GACACAAAAG ACGGATGGAA AGGAAA.GAGA GGACACAGAA 2830 

GAGAGCACTC GATTTACAGA AAGGGCGATT ACATTAT TAG AGAATCTTGG TGTAATCCAA 2890 

TCTGCAGCAA AATTAGACCT ATACCAAGAC AAGAGAGTTG TGTGTGTGGC GAATGTCCTA 2950 

AACAATGCAG ATACTGCATC AAAGATAGAC TTCCTAGCAG GTTTGATGAT AGGAGTGTCA 3010 

ATGGATCATG ATACCAAATT AAATCAGATT CAGAACGAGA TATTAAGTTT GAAAACTGAT 3070 

CTTAAAAAGA TGGATGAATC ACATAGAAGA CTAATTGAGA ATCAAAAAGA ACAATTATCA 3130 

CTGATCACAT CATTAATCTC AAATCTTAAA ATTATGACAG AGAGAGGAGG GAAGAAGGAC 3190 

CAACCAGAAC CTAGCGGGAG GACATCCA.TG ATCAAGACAA AAGCAAAAGA AGAGAAAATA 3250 ' 

AAGAAAGTCA GGTTTGACCC TCTTATGGAA ACACAGGGCA TCGAGAAAAA CATCCCTGAC 3310 

CTCTATAGAT CAATAGAGAA AACACCAGAA AACGACACAC AGATCAAATC AGAAATAAAC 3370 

AGATTGAATG ATGAATCCAA TGCCACTAGA TTAGTACCTA GAAGAATAAG CAGTACAATG 3430 

AGATCATTAA TAATAATCAT TAACAACAGC AATTTATCAT CAAAAGCAAA GCAATCATAC 34 90 

ATCAACGAAC TCAAGCTCTG CAAGAGTGAC GAGGAAGTGT CTGAGTTGAT GGACATGTTC 3550 

AATGAGGATG TCAGCTCCCA GTAAACCGCC AACCAAGGGT CAACACCAAG AAAACCAATA 3610 

GCACAAAACA GCCAATCAGA GACCACCCCA ATACACCAAA CCAATCAACA. CATAACAAAG 3670 

ATCTCCAGAT CA.TAGATGAT TAAGAAAAAC TTAGGATGAA AGGACTAATC AATCCTCCGA 3730 

AACAATGAGC ATCACCAACT CCACAATCTA CA.CA.TTCCCA GAATCCTCTT TCTCCGAGAA 3790 

TGGCAACATA GAGCCGTTAC CACTCAAGGT CAATGAACAG AGAAAGGCCA TACCTCATAT 3850 

TAGGGTTGTC AAGATAGGAG ATCCGCCCAA ACATGGATCC AGATATCTGG ATGTCTTTTT 3910 

ACTGGGCTTC TTTGAGATGG AAAGGTCAAA AGACAGGTAT GGGAGCATAA GTGATCTAGA 3970 

TGATGATCCA AGTTACAAGG TTTGTGGCTC TGGATCATTG CCACTTGGGT TGGCTAGATA 4 030 

CACCGGAAAT GATCAGGAAC TCCTACAGGC TGCAACCAAG CTCGATATAG AAGTAAGAAG 4090 

AACTGTAAAG GCTACGGAGA TGATAGTTTA CACTGTACAA AACATCAAAC CTGAACTATA 4150 

TCCATGGTCC AGTAGATTAA GAAAAGGGAT GTTATTTGAC GCTAATAAGG TTGCACTTGC 4210 

TCCTCAATGT CTTCCACTAG ATAGAGGGAT AAAATTCAGG GTGATATTTG TGAACTGCAC 4270 

AGCAATTGGA TCAATAACTC TATTCAAAAT CCCTAAGTCC ATGGCATTGT TATCATTGCC 4330 

TAATACAATA TCAATAAATC TACAAGTACA TATCAAAACA GGAGTTCAGA CAGAT TCCAA 4390 

AGGAGTAGTT CAGATTCTAG ATGAAAAAGG TGAAAAATCA CTAAATTTCA TGGTTCATCT 44 50 

CGGGTTGATC AAAAGGAAGA TGGGCAGAAT GTACTCAGTT GAATATTGTA AGCAGAAGAT 4510 

CGAGAAGATG AGATTATTAT TCTCATTGGG ATTAGTTGGA GGGATCAGCT TCCACGTCAA 4570 

CGCAACTGGC TCTATATCAA AGACATTAGC AAGTCAATTA GCATTCAAAA GAGAAATCTG 4 630 

CTATCCCCTA ATGGATCTGA ATCCACACTT AAATTCAGTT ATATGGGCAT CATCAGTTGA 4 690 

AATTACAAGG 4700 
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GTAGATGCAG TTCTCCAGCC TTCATTACCT GGCGAAT TCA GATACTACCC AAACATCATA 4760 

GCAAAAGGGG TCGGGAAAAT CAGACAGTAA AATCAACAAC CCTGATATCC AACATTGCAA 4 820 

ATCAGGCTAC CCACAGGAGA AAAATCAAAA ACTTAGGATC AAAGGGATCA CCACGAACCC 4880 

CGGAAAACAG CCAAACAAAC CAACACACAA ATCACAGACA AAAA.GGAGAA GGCAGTGCAA 4 94 0 

AGACCGAGAA AAAACAGAAC GCACACAACC AAGCAGAGAA AAGCCAAAGC CCGCCATTCA 5000 

CAAACACACC AACAATCCTG CAAACAAGCA CCAAAACAGA GGTCAAAAGA CAAAGAGCAC 5060 

CAGATATGAC CATCACAACC ACAATCATAG CCATATTACT AATACCCCCA TCATT TTGTC 5120 

AAA.TAGACAT AACAAAACTG CAACGTGTAG GTGTGTTAGT CAACAATCCT AAAGGCATGA 5180 

AGATTTCACA AAATTTCGAA ACGAGATACC TGATATTAAG TTTGATACCC AAAATAGAGA 5240 

ATTCACACTC ATGTGGGGAT CAACAGATAA ACCAATACAA GAAGTTATTG GATAGATTGA 5300 

TAATTCCTCT ATATGATGGA TTAAAATTAC AAAAAGATGT AATAGTAGTA AGTCATGAAA 5360 

CCCACAACAA TACTAATCTT AGGACAfiAAC GATTCTT TGG AGAGATAATT GGGACAATTG 5420 

CGATAGGGAT AGCCACTTCA GCACAAA.TCA CCGCAGCAGT CGCTCTTGTC GAAGCTAAAC 54 80 

AGGCAAAGTC AGACATAGAA AAACTCAAAG AGGCTATAAG AGACACAAAC AAGGCAGTAC 554 0 

AATCGATTCA AAGTTCTGTA GGTAACCTAA TTGTTGCAGT TAAATCAGTT CAAGACTATG 5 600 

TCAACAATGA AATTATACCT TCAATCACAA GATTAGGCTG TGAAGCAGCA GGGTTACAAT 5 660 

TGGGAATTGC ATTGACACAA CATTACTCAG AATTAACAAA TATATTTGGT GATAATATAG 5720 

GAACACTGAA AGAAAAAGGG ATAAA&TTAC AAGGGATAGC ATCATTATAT CACACAAACA 5760 

TAACGGAAAT ATTTACTACT TCAACAGTTG ACCAATATGA TATTTATGAC CTATTATTCA 5840 

CTGAGTCAAT CAft.GATGAGA GTGATAGATG TTGATTTGAG TGATTACTCA ATTACTCTTC 5 900 

AAGTTAGACT TCCTTTATTA ACTAAACTAT CAAATACTCA AATTTATAAA GTAGATTCTA 5 960 

TATCATACAA CATCCAGGGC AAAGAGTGGT ATATTCCTCT TCCCAATCAC ATCATGACAA 6020 

AAGGGGCTTT TCTAGGTGGT GCTGATATTA AAGAATGCAT AGAGGCATTC AGCAGTTATA 6080 

TATGTCCTTC TGATCCAGGT TACATATTAA ATCACGAGAT AGAGAATTGT TTATCAGGGA 6140 

ACATAACACA GTGTCCTAAG ACTGTTGTTA CATCAGATGT GGTACCACGA TACGCGTTTG 6200 

TGAATGGTGG ATTAATTGCA AACTGCATAA CAACTACATG TACATGCAAT GGAATTGACA 62 60 

ATAGAATTAA TCAATCACCT GATCAAGGAA TTAAGATCAT AACACATAAA GAATGCCAGG 6320 

TAATAGGTAT AAACGGAATG TTATTCAATA CTAATAGAGA AGGGACATTA GCAACTTATA 6380 

CATTTGATGA CATCATATTA AATAACTCTG TTGCACTTAA TCCAATTGAT ATATCTATGG 644 0 

AACTCAACAA GGCAAAACTA GAATTAGAAG AATCGAAGGA ATGGATAAAG AAATCAAATC 6500 

AAAAGTTAGA TTCCGTTGGA AGTTGGTATC AATCTAGTGC AACAATCACC ATAATCATAG 6560 

TGATGATAAT AATTCTAGTT ATAATCAATA TAACAAT TAT TGTAGTCATA ATCAAATTCC 6620 

ATAGAATTCA GGGGAAAGAT CAAAA.CGACA AAAACAGTGA GCCGTATATA CTGACAAATA 6680 

GACAATAAGA CTATACACGA TCAAATATAA AAAGTACAAA AAACTTAGGA ACAAAGTTGT 674 0 

TCAACACAGC AGCACCGAAT AGACCAAAAG GCAGCGCAGA GGCGACACCA AACTCAAAAA 6800 

TGGAATATTG GAAACACACA AACAGCATAA ATAACACCAA CAATGAAACC GAAACAGCCA 68 60 

GAGGCAAACA TAGTAGCAAG GTTACAAATA TCATAATGTA CACCTTCTGG ACAATAACAT 6920 

TAACAATATT ATCAGTCATT TTTATAATGA TATTGACAAA CTTAATTCAA GAGAACAATC 6980 

ATAATAAATT AATGTTGCAG GAAATAAGAA AAGAATTCGC GGCAATAGAC ACCAAGATTC 7040 
AGAGGACTTC 
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GGATGACATT GGAA.CCTCAA TACAGTCAGG AATAAATACA AGACTTCTCA CAATTCAGAG 7110 

TCATGTTCAA AACTATATCC CACTATCATT AACACAACAA ATGTCAGATC TCAGAAAATT 7170 

TATCAATGAT CTAACAAATA AAAGAGAACA TCAAGAAGTG CCAATACAGA GAATGACTCA 7230 

TGATAGAGGT ATAGAACCCC TAAATCCAAA CAAGTTCTGG AGGTGTACAT CTGGTAACCC 7290 

ATCTCTAACA AGTAGTCCTA AGATAAGGTT AATACCAGGA CCAGGTTTAT TAGCAACATC 7350 

TACTACAGTA AATGGCTGTA TTAGAATTCC ATCGTTAGTA ATCAATCATC TAATCTATGC 7410 

TTACACCTCT AATCTTATTA CCCAGGGCTG TCAkGATATA GGGAAATCTT ACCAAGTACT 7470 

ACAAATAGGG ATAATTACTA TAAAT TCGGA CCTAGTACCT GATTTAAACC CCAGAGTCAC 7530 

ACATACATTT AATATTGATG ATAATAGAAG ATCTTGCTCT CTGGCACTAT TGAATACAGA 75 90 

TGTTTATCAG TTATGCTCAA CACCAAAAGT TGATGAAAGA TCCGATTATG CATCAACAGG 7 650 

TATTGAGGAT ATTGTACTTG ACATTGTCAC TAATAATGGA TTAATTATAA CAACAAGGTT 7710 

TACAAATAAT AATATAACTT TTGATAAACC GTATGCAGCA TTGTATCCAT CAGTGGGACC 7770 

AGGAATCTAT TATAAGGATA AAGTTATATT TCTCGGATAT GGAGGTCTAG AGCATGAAGA 7830 

AAACGGAGAC GTAATATGTA ATACAACTGG TTGTCCTGGC AAAACACAGA GAGACTGTAA 7890 

TCAGGCTTCT TATAGCCCA.T GGTTCTCAAA TAGGAGAATG GTAAACTCTA TTATTGTTGT 7 950 

TGATAAAGGC ATAGATGCAA CTTTTAGCTT GAGGGTGTGG ACTATTCCAA TGAGCCAA?^ 8010 

TTATTGGGGA TCAGAAGGAA GATTACTTTT ATTAGGTGAC AGAATATACA TATATACTAG 8070 

ATCCACAAGT TGGCACAGTA AATTACAGTT AGGGGTAATT GATATTTCTG ATTATACTAA 8130 

TATAAGAATA AATTGGACTT GGCATAATGT ACTATCACGG CCAGGGAATG ATGAATGTCC 8190 

ATGGGGTCAT TCATGCCCAG ACGGATGTAT AACAGGAGTT TACACTGATG CATATCCGCT 8250 

AAACCCATCG GGGAGTGTTG TATCATCAGT AATTCTTGAT TCACAAAAGT CTAGAGAAAA 8310 

CCCAATCATT ACTTACTCAA CAGCTACAAA TAGAATAAAT GAATTAGCTA TATATAACAG 8370 

AACACTTCCA GCTGCATATA CAACAACAAA TTGTATCACA CATTATGATA AAGGGTATTG 8430 

TTTTCATATA GTAGAAATAA ATCACAGAAG TTTGAATACG TTTCAACCTA TGTTATTCAA 8490 

AACAGAAGTT CCAAAAAACT GCA.GCTAAAT TGATCATCGC ATATCGGATG CAAGATGACA 8550 

TTAAAAGAGA CCACCAGACA GACAACACAG GAGACGATGC AAGATATAAA GAAATAATAA 8 610 

AAAACTTAGG AGAAAAGTGT GCAAGAAAAA TGGACACCGA GTCCCACAGC GGCACAACAT 8670 

CTGACATTCT GTACCCTGAA TGTCACCTCA ATTCTCCTAT AGTTAAAGGA AAGATAGCAC 8730 

AACTGCATAC AATAATGAGT TTGCCTCAGC CCTACGATAT GGATGATGAT TCAATACTGA 8790 

TTATTACTAG ACAAAAAATT AAACTCAATA AATTAGATAA AAGACAACGG TCAATTAGGA 8850 

AATTAAGATC AGTCTTAATG GAAAGAGTAA GTGATCTAGG TAAATATACC TTTATCAGAT 8910 

ATCCAGAGAT GTCTAGTGAA ATGTTCCAAT TATGTATACC CGGAATTAAT AATAAAATAA 8 970 

ATGAATTGCT AAGTAAAGCA AGTAAAACAT ATAATCAAAT GACTGATGGA TTAAGAGATC 9030 

TATGGGTTAC TATACTATCG AAGTTAGCAT CGAAAAATGA TGGAAGTAAT TATGATATCA 9090 

ATGAAGATAT TAGCAATATA TCAAATGTTC ACATGACTTA TCAATCAGAC AAATGGTATA 9150 

ATCCATTCAA GACATGGTTT ACTATTAAGT ATGACATGAG AAGATTACAA AAAGCCAAAA 9210 

ATGAGATTAC ATTCAATAGG CATAAAGATT ATAATCTATT AGAAGACCAA AAGAATATAT 9270 

TGCTGATACA TCCAGAACTC GTCTTAATAT TAGATAAACA AAATTACAAT GGGTATATAA 9330 

TGACTCCTGA ATTGGTACTA ATGTATTGTG ATGTAGTTGA AGGGAGGTGG AATATAAGTT 9390 

CATGTGCAAA 94 00 
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ATTGGATCCT AAGTTACAAT CAATGTATTA TAAGGGTAAC AATTTATGGG AAATAATAGA 94 60 

TGGACTATTC TCGACCTTAG GAGAAAGAAC ATTTGACATA ATATCACTAT TAGAACCACT 9520 

TGCATTATCG CTCATTCAAA CTTATGACCC GGTTAAACAG CTCAGGGGGG CTTTTTTAAA 9580 

TCACGTGTTA TCAGAAATGG AATTAATATT TGCAGCTGAG TGTACAACAG AGGAAATACC 9640 

TAATGTGGAT TATATAGATA AAATTTTAGA TGTGTTCAAA GAATCAACAA TAGATGAAAT 9700 

AGCAGAiUVTT TTCTCTTTCT TCCGAACTTT TGGACACCCT CCATTAGAGG CGAGTATAGC 9760 

AGCAGAGAAA GTTAGAAAGT ATATGTATAC TGAGAAATGC TTGAAATTTG ATACTATCAA 9820 

TAAATGTCAT GCTATTTTTT GTACAATAAT TATAAATGGA TATAGAGAAA GACATGGTGG 9880 

TCAATGGCCT CCAGTTACAT TACCTGTCCA TGCACATGAA TTTATCATAA ATGCATACGG 9940 

ATCAAATTCT GCCATATCAT ATGAGAATGC TGTAGAT TAT TATAAGAGCT TCATAGGAAT 10000 

AAAATTTGAC AAGTTTATAG AGCCTCAATT GGATGAA.GAC TTAACTATTT ATATGAAAGA 10060 

TAAAGCATTA TCCCCAAAGA AATCAAACTG GGACACAGTC TATCCAGCTT CAAACCTGTT 10120 

ATACCGCACT AATGTGTCTC ATGATTCACG AAGATTGGTT GAAGTATTTA TAGCAGATAG 10180 

TAAATTTGAT CCCCACCAAG TATTAGATTA CGTAGAA.TCA GGATATTGGC TGGATGATCC 10240 

TGAATTTAAT ATCTCATATA GTTTAAAAGA GAAAGAAATA AAACAAGAAG GTAGACTTTT 10300 

TGCAAAAATG ACATACAAGA TGAGGGCTAC ACAAGTATTA TCAGAAACAT TATTGGCGAA 10360 

TAATATAGGG AAATTCTTCC AAGAGAATGG GATGGTTAAA GGAGAAATTG AATTACTCAA 10420 

GAGACTAACA ACAATATCTA TGTCTGGAGT TCCGCGGTAT AATGAGGTAT ACAATAATTC 10480 

AAAAAGTCAC ACAGAAGAAC TTCAAGCTTA TAATGCAATT AGCAGTTCCA ATTTATCTTC 1054 0 

TAATCAGAAG TCAAAGAAGT TTGAATTTAA ATCTACAGAT ATATACAATG ATGGATACGA 10600 

AACCGTAAGC TGCTTCTTAA CGACAGATCT TAAAAAATAT TGTTTAAATT GGAGGTATGA 10660 

ATCAACAGCT TTATTCGGTG ATACTTGTAA TCAGATATTT GGGTTAAAGG AATTATTTAA 10 720 

TTGGCTGCAC CCTCGCCTTG AAAAGAGTAC AATATATGTT G6AGATCCTT ATTGCCCGCC 10780 

ATCAGATATT GAA.CATTTAC CACTTGATGA CCATCCTGAT TCAGGATTTT ATGTTCATAA 10840 

TCCTAAAGGA GGAATAGAAG GGTTTTGCCA AAAGTTATGG ACACTCATAT CTATCAGTGC 10900 

AATACATTTA GCAGCTGTCA AAATCGGTGT AAGAGTTACT GCAATGGTTC AAGGGGATAA 10960 

TCAAGCCATA GCTGTTACCA CAAGAGTACC TAATAATTAT GATTATAAAG TTAAGAAAGA 11020 

GATTGTTTAT AAAGATGTGG TAAGATTTTT TGATTCCTTG AGAGAGGTGA TGGATGATCT 11080 

GGGTCATGAG CTCAAACTAA ATGAAACTAT AATAAGTAGT AAAATGTTTA TATATAGCAA 11140 

AAGGATATAC TATGACGGAA GAATCCTTCC TCAGGCATTA AAAGCATTGT CTAGATGTGT 11200 

TTTTTGGTCT GAAACAATCA TAGATGAGAC AAGATCAGCA TCCTCAAATC TGGCTACATC 11260 

GTTTGCAAAG GCCATTGAGA ATGGCTACTC ACCTGTATTG GGATATGTAT GCTCAATCTT 11320 

CAAAAATATC CAACA6TT6T ATATAGCGCT TGGAATGAAT ATAAACCCAA CTATAACCCA 11380 

AAATATTAAA GATCAA.TATT TCAGGAATAT TCATTGGATG CAATATGCCT CCTTAATCCC 11440 

TGCTAGTGTC GGAGGATTTA ATTATATGGC CATGTCAAGG TGTTTTGTCA GAAACATTGG 11500 

AGATCCTACA GTCGCTGCGT TAGCCGATAT TAAAAGATTT ATAAAAGCAft. ATTTGTTAGA 115 6 0 

TCGAGGTGTC CTTTACAGAA TTATGAATCA AGAACCAGGC GAGTCTTCTT TTTTAGACTG 11620 

GGCCTCAGAT CCCTATTCAT GTAACTTACC ACAATCTCAA AATATAACCA CCATGATAAA 11680 

GAA.TATAACT GCAAGAAATG TACTACAGGA CTCACCAAAC CCATTACTAT CTGGATTATT 11740 

TACAAGTACA 11750 
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SEQ ID NO: 36 



ATGATAGAAG AGGATGAGGA ATTAGCTGAG TTCCTAATGG ACAGGAGAAT AATCCTCCCA 11850 

AGAGTTGCAC ATGACATTT T AGATAATTCT CTTACTGGAA TTAGGAATGC TATAGCTGGT 11870 

ATGTTGGATA CAACAAAATC ACTAATTCGA GTAGGGATAA GCAGAGGAGG ATTAACCTAT 11930 

AACTTATTAA GAAAGATAAG CAACTATGAT CTTGTACAAT ATGAGACACT TAGTAAAACT 11990 

TTAAGACTAA TAGTCAGTGA CAAGATTAAG TATGAAGATA TGTGCTCAGT AGACCTAGCC 12050 

ATATCATTAA GACAAAAAAT GTGGATGCAT TTATCAGGAG GAAGAATGAT AAATGGACTT 12110 

GAAACTCCAG ATCCTTTAGA GTTACTGTCT GGAGTAATAA TAACAGGATC TGAACATTGT 12170 

AGGATATGTT ATTCAACTGA AGGTGAAAGC CCATATACAT GGATGTATTT ACCAGGCAAT 12230 

CTTAATATAG GATCAGCTGA GACAGGAATA GCATCAT TAA GGGTCCCTTA CTTTGGATCA 12290 

GTTACAGATG AGAQATCTGA AGCACAATTA GGGTATATCA AAAATCTAAG CAAACCAGCT 12350 

AAGGCTGCTA TAAGAATAGC AATGATATAT ACTTGGGCAT TTGGGAATGA CGAAATATCT 12410 

TGGATGGAAG CATCACAGAT TGCACAAACA CGTGCAAACT TTACATTGGA TAGCTTAAAG 12470 

ATTTTGACAC CAGTGACAAC ATCAACAAAT CTATCACACA GGTTAAAAGA TACTGCTACT 12530 

CAGATGAAAT TTTCTAGTAC ATCACTTATT AGAGTAAGCA GGTTCATCAC AATATCTAAT 12590 

GATAATATGT CTATTAAAGA AGCAAATGAA ACTAAAGATA CAAATCTTAT TTATCAACAG 12 650 

GTAATGTTAA CAGGATTAAG TGTATTTGAA TATCTATTTA GGTTAGAGGA GAGTACAGGA 12710 

CATAACCCTA TGGTCATGCA TCTACATATA GAGGATGGAT GTTGTATAAA AGAGAGTTAC 12770 

AATGATGAGC ATATCAATCC GGAGTCTACA TTAGAGTTAA TCAAATACCC TGAGAGTAAT 12830 

GAATTTATAT ATGATAAGGA CCCTTTAAAG GATATAGATC TATCAAAATT AATGGTTATA 12890 

AGAGATCATT CTTATACAAT TGACATGAAT TACTGGGATG ACACAGATAT TGTACATGCA 12950 

ATATCAATAT GTACTGCAGT TACAATAGCA GATACAA.TGT CGCAGCTAGA TCGGGATAAT 13010 

CTTAAGGAGC TGGTTGTGAT TGCAAATGAT GATGATATTA ACAGTCTGAT AACTGAATTT 13070 

CTGACCCTAG ATATACTAGT GTTTCTCAAA ACATTTGQAG GGTTACTCGT GAATCAATTT 13130 

GCATATACCC TTTATGGATT GAAAATAGAA GGAAGGGATC CCATTTGGGA TTATATAATG 13190 

AGAACATTAA AAGACACCTC ACATTCAGTA CTTAAAGTAT TATCTAATGC ACTATCTCAT 13250 

CCAAAAGTGT TTAAGAGATT TTGGGATTGT GGAGTTTTGA ATCCTATTTA TGGTCCTAAT 13310 

ACTGCTAGTC AAGATCAAGT TAAGCTTGCT CTCTCGATTT GCGAGTACTC CTTGGATCTA 13370 

TTTATGAGAG AATGGTTGAA TGGAGCATCA CTTGAGATCT ATATCTGTGA TAGTGACATG 13430 

GAAATAGCAA ATGACACaAG ACAAGCATTT CTCTCAAGAC ATCTTGCCTT TGTGTGTTGT 13490 

TTAGCAGAGA TAGCATCTTT TGGACCAAAT TTATTAAATC TAACATATCT AGAGAGACTT 13550 

GATGAATTAA AACAATACTT AGATCTGAAC ATCAAAGAAG ATCCTACTCT TAAATATGTG 13610 

CAAGTATCAG GACTGTTAAT TAAATCATTC CCCTCAACTG TTACGTATGT AAGGAAAACT 13670 

GCGATTAAGT ATCTGAGGAT TCGTGGTATT AATCCGCCTG AAACGATTGA AGATTGGGAT 13730 

CCCATAGAAG ATGAGAATAT CTTAGACAAT ATTGTTAAAA CTGTAAATGA CAATTGCAGT 137 90 

GATAATCAAA AGAGAAATAA AAGTAGTTAT TTCTGGGGAT TAGCTCTAAA GAATTATCAA. 13850 

GTCGTGAAAA TAAGATCCAT AACGAGTGAT TCTGAAGTTA ATGAAGCTTC GAATGTTACT 13910 

ACACATGGAA TGACACTTCC TCAGGGAGGA AGTTATCTAT CACATCAGCT GAGGTTATTT 13970 

GGAGTAAACA GTACAAGTTG TCTTAAAGCT CTTGAATTAT CACAAATCTT AATGAGGGAA 14030 

GTTAAAAAAG ATAAAGATAG ACTCTTTTTA GGAGAAGGAG CAGGAGCTAT GTTAGCATGT 14090 

TATGATGCTA 14100 
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SEQ ID NO: 36 












CACTCGGTCC 


TGCAATAAAT 


TATTATAATT 


CTGGTTTAAA 


TATTACAGAT 


GTAAT TGGTC 


14160 


AACGGGAATT 


AAAAATCTTC 


CCATCAGAAG 


TATCATTAGT 


AGGTAAAAAA. 


CTAGGAAATG 


14220 


TAft.CACAC5AT 


TCTTAATCGG 


GTGAGGGTGT 


TATTTAATGG 


GAATCCCAAT 


TCAACATGGA 


14280 


TAGGAAATAT 


GGAA.TGTGAG 


AGTTTAATAT 


GGAGTGAATT 


AAATGATAAG 


TCAATTGGTT 


14340 


TAGTACATTG 


TGACATGGAG 


GGAGCGATAG 


GCAAATCAGA 


AGAAACTGTT 


CTACATGAAC 


14400 


ATTATAGTAT 


TATTAGGAT T 


ACATATTTAA 


TCGGGGATGA 


TGATGTTGTC 


CTAGTATCAA 


14460 


AAATTATACC 


AACTATTACT 


CCGAATTGGT 


CTAAAATACT 


CTATCTATAC 


AAGTT GTATT 


14520 


GGAAGGATGT 


AAGTGTAGT G 


TCCCTTAAAA 


CATCCAATCC 


TGCCTCAACA 


GAGCT TTATT 


14580 


TAATTTCAAA 


AGATGCTTAC 


TGTACTGTAA 


TGGAACCCAG 


TAATCTTGTT 


TTATCAAAAC 


14640 


TTAAAAGGAT 


ATCATCAATA 


GAAGAAAATA 


ATCTATTAAA 


GTGGATAATC 


TTATCAAAAA 


14700 


GGAAGAA.TAA 


CGAGTGGTTA 


CAGCATGAAA 


TCAAAGAAGG 


AGAAAGGGAT 


TATGGGATAA 


14760 


TGAGGCCATA 


TCATACAGCA 


CTGCAAATTT 


TTGGATT CCA 


AATTAACTTA 


AATCACTTAG 


14820 


CTAGAGAATT 


TTTATCAACT 


CCTGATTTAA 


CCAACATTAA 


TAATATAATT 


CAAAGTTTTA 


14880 


CAAGAACAAT 


TAAAGATGTT 


ATGTTCGAAT 


GGGTCAATAT 


CACTCATGAC 


AATAAAAGAC 


14940 


ATAAATTAGG 


AGGAAGATAT 


AATCTATTCC 


CGCTTAAAAA 


TAAGGGGAAA 


TTAAGATTAT 


15000 


TATCACGAAG 


AT TAGTACT A 


AGCTGGATAT 


CATTATCCTT 


ATCAACCAGA 


TTACTGACGG 


15060 


GCCGTTTTCC 


R.GA.TGAAAAA 


TTTGAAAATA 


GGGCACAGAC 


CGGATATGTA 


TCATTGGCTG 


15120 


ATATTGATTT 


AGAATCCTTA 


AAGTTATTAT 


CAAGAAATAT 


TGTCAAAAAT 


TACAAAGAAC 


15180 


ACATAGGATT 


AATATCATAC 


TGGTTTTTGA 


CCAAAGAGGT 


CAAAATACTA 


ATGAAGCTTA 


15240 


TAGGAGGAGT 


CAAACTACTA 


GGAATTCCTA 


AACAGTACAA 


AGAGTTAGAG 


GATCGATCAT 


15300 


CTCAGGGTTA 


TGAATATGAT 


AATGAATTTG 


ATATTGATTA 


ATACATAAAA 


ACATAAAATA 


15360 


AAACACCTAT 


TCCTCACCCA 


TTCACTTCCA 


ACAAAATGAA AAGTAAGAAA 


AACAT GTAAT 


15420 


ATATATATAC 


CAAACAGAGT 


TTTTCTCTTG 


TTTGGT 






15456 
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FIG. 3A 

Mutagenesis to create restriction sites at start end stop condons of N 

— i HPIV3 N ~[ - H: ^/••/•■X^^^ 

I pUC119JSN I I pBS-KoN or pBS-SFN 1 

CAAAAATGTTG I GCAACTAATCGA CAAAAATGTTG I GCAACTAATCGA 
SEQ ID NO: 37 i SEQ ID NO: 38 SEQ ID NO: 37 i SEQ ID NO: 38 



HPIV3 N 



TAA CCATGG TGA 

pUC119JSN-Ncol/Aflll 
SEQ ID NO: 39 j 

Ncol A fill digestion 



SEQ ID NO: 40 ! SEQ ID NO: 39 

GCA CTTAAGC AC TAA CCATGG TGA 
Ncol 



1^ 



SEQ ID NO: 40 I 

GCACTTAAGCAC 



FIG 




. pUC119KaN-Ncol/Aflll 
! or pUC119SFN-Ncol/Aflll 

I + 

I Ncol A fill digestion ^ 

h:::-:-:m^:^:-Q(^ 



pUCn9B/HKaN-Ncol/Aflll or 
pUC119B/HSFN-Ncol/Aflll 



FIG. 30 

Mutagenesis to restore start and stop condon context 



Ncol 
TAA CCATGG TGA 
SEQ ID NO: 39 | 

CAAAAATGTTGA 
SEQ ID NO: 41 



A fill 

GCA CTTAAG CAC 
SEQ ID NO: 40 | 

GCAACTAGTCGA 
SEQ ID NO: 42 



-mm 

i pu 



pUC119B/HKaN or 
pUC119B/HSFN 



EcoRI 



LEGEND 

BPIV3 sequence 

□ HPIV3 sequence 



r 
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FIG. 4 A 

Mlu I Eco Rl 
I I 

I [:^ :N ]| pUC119B/HSFN 
^^pUC119B/HKaN 



T7 promoter 

rr 

aix: 



Legend 

I I HPIV3 sequence 
r • I BPIV3 Ka sequence 
Kj\y--.j BPIV3 SF sequences 
plasmid sequence 



IM I F IHN b;^^ 



T7 promoter 

^ few II P 



pLeft+2G 

i 



pLeltKoN or 
pLeltSFN 



FIG. 4B 



T7 terminator 
delta ribozyme 

I Ngo Ml 

V////A 



T7 promoter 



l^iwil p Nl 



T7 promoter 



Xbo\\ Ngo Ml- 



pLeftKoN or 
pLeftSFN 



mmmi 



\ U \ F I HN I 



T7 terminotor 
delta ribozyme 



pB/HPIV3KoN or 
pB/HPIV3SFN 
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FIG. 5 A 



SEQ ID NO: 


43 


rJS 


SEQ ID NO: 


44 


cKa 


SEQ ID NO: 


45 


cSF 


SEQ ID NO: 


46 


Ko 


SEQ ID NO: 


47 


SF 



GGAACTCTATAATTTCAAAAATGTTGAGCCTATTTGATAC 
GGAACTCTATAATTTCAAAAATGTTGAGTCTATTCGACAC 
GGAACTCTATAATTTCAAAAATGTTGAGTCTATTCGACAC 
GAAATCCTAAGACTGTAATCATGTTGAGTCTATTCGACAC 
GAAATCCTAAGACTGTAATCATGTTGAGTCTATTCGACAC 



FIG. 5B 



SEQ ID NO: 


48 


rJS 


SEQ ID NO: 


49 


cKo 


SEQ ID NO: 


50 


cSF 


SEQ ID NO: 


51 


Kq 


SEQ ID NO: 


52 


SF 



TTAACGCATTTGGAAGCAACTAATCGAATCAACATTTTAA 
TCAGTGGATTCGGAAGCAACTAGTCGAATCAACATTTTAA 
TCAGTGCATTCGGAAGCAACTAGTCGAATCAACATTTTAA 
TCAGTGCATTCGGAAGCAACTAGTCACAAAGAGATGACCA 
TCAGTGCATTCGGAAGCAACTAGTCACAAAGAGATGACCA 



FIG. 13 




0-] 1 1 1 1 1 1 1 

0 1 2 5 4 5 6 7 
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Confirmation of identity of potential BPIV3/HPIV3 chinneras by TaqI digestion 

Figure 6A 



rJS 


1 NP 1 


P 


1 M 1 


F 


1 HN 1 


L 1 
















cKa 


1 isr-il 


P 


1 M 1 


F 


1 HN 1 


L 
















cSF 


1 \im\\ 


P 


1 M 1 


F 


1 HN 1 


L 
















Ka 


1 NP 1 


P 


1 M 1 


F 


1 HN 1 


L 
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, - FIG. 8A 



I BPIV3 



rHPIV3 I 


' N 


P/C/O/V 


M 


F 


HN 


L 




rHPIV3-PB 


* H 




M 


F 


HN 


L 5' 



FIG. 8B 

rHPIV3 



rHPIV3-MB 
BPIV3 



3' N 


P/C/D/V 


M 


F 


HN 


L 5' 




5" N 


P/C/D/V 




F 


HN 


L 5^ 



FIG. 11 A 



rHPIV3-FBHNB 
rHPIV3-FHHNH 
BP1V3 Ka 



3' N 


P/C/D/V 


M 


F 


HN L 5' 


Sg, 


Al fo/WI 


3' N 


P/C/D/V 


M 






Sgi 


Al &/WI 








F 





CF1 = 

ev3 — 
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Figure 9A 



Figure 9B 



+ - (RT) 
M 1 2 



2027 




rHPIV3-PB 
Figure 9C 



1629 

rPiv3-pB 1: ^: :rb:obf 1 . 

2355(S) 

1 629 

rHPIV3 wt I I FDRF 



2322 - 

1353 
872 
603 
310 



rHPIV3-PB 



nts 1629-3784 (2156 bp) 



1430 bp 



2355 (S) 
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FigurelOA 



FigurelOC 

rPIV3-MB L 




1 M 2 3, 

I 



FigurelOB 



rHPIV3-MB 



rHPIV3-MB 



5073 nts 1629-5073 (3445 bp) 
_ J Spoi 



3454(S) 4559(S) 4810(S) 
A"X1Sb-^RK NTX-X-i ! 



1629 2295(e) 3691(e) 3912(e) 5073 nts 1629-5073 (3445 bp) 

rPIV3-MB I L_ ...__J____ji"CS31g5QBFX:X5Zl I FcoRV 

666 bp 1396 bp 221bp 1162 bp 

1629 2295(e) 3691(E) 5079 

rHPiV3 I i L...._..i::zrzH]DBEZz::i=n i EcoRV 

3894 (E) 4806 (£) 

BPIV3 ! ^ sixiXiaBDHE^X::^ ^ EcoRV 



1629 5073 nts 1629-5073 (3445 bp) 

rPIV3-MB I rx:SZSSfiSFS:X3:S i Xba\ sites 

2354 bp 1 441 bp i 650 bp 

3983(^ 442A{X) 

1629 , 5079 
rHPIVS I i H ORF : I no Xba\ sites 

BPIV3 I [S3:-'S:iftiX> B|Nrv x? I Xba\ sites 

39'65(X) 4406(X) 




l00Z'5-^["{ :3ica 3u![!j ['006/60 : on I^uas 
iZJocjoSnj SHMiDDVA. 

(Aid) snyiA vzN3mjNrvyVci oiygiMiHO 
3NiAoe-NVHnH aHxvnNaj-Lv -mi 

) so(nodopEi>is H ou^pv :jo]U3au[ 



T-T-intD CMcy <^ CO _ _ 

m^inco coo *^°Er o *^r:S 

•il II III I 

I I 

U ft • ^ fc 

I .. • It 

U II 

4ii n III t ^ 



lOOC 'C '^inf -^i^a °m\A Zl l'006/60 ; oN[ |EU3S 
LZ JO S3 oiJud S'JNIOOVA 

(Aid) sn^UA vzNan'i;mivavd oiynwiHO 

HNlAOa-NVlMflU a3J.VnN3JJ.V :3in.L 
in soinodoi>ci>is 11 OLlc^^| ;.ioiLt3AU[ 



= CN1 
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r FIG. 15 



3' N 


P/C/D/V 


M 


F 


HN 


L 5' 




i N 


P/C/D/V 


M 


F 


HN 





rBPIV3 Kansas 




FIG. 16 

I L START 

i SEQ ID NO: 61 rHPIV3 WT 8623 5' TAGGAGCAAAGCGTGCTCGGGAAATGGACACTGAATCTAACA 3' 8664 

I SEQ ID NO: 62 rHPIV3 Lb 8623 5' TAGGAGCAAAGCGTGCTCGGGAA ATGGACACCGAGTCCCACA 3' 8664 

I SEQ ID NO: 63 rBPIV3 wt 8617 5' TAGGAGAAAAGTGTGCAAGAAAAATGGACACCGAGTCCCACA 3' 8658 

] L STOP 

I SEQ ID NO: 64 rHPIV3 WT 15325 5' ATGATGAATTTGATATCGATTAAAACATAAATACAATGAAGA 3' 15366 

I SEQ ID NO- 65 rHPIV3 Lb 15325 5' ATAATGAATTTGATATTGATTAAT ACgMCGTACAATGAAGA 3' 15366 

I SEQ ID NO' 66 rBPIV3 wt 15319 5' ATAATGAATTTGATATTGATTAATACATAAAAACATAAAATA 3' 15360 



